A Text Retrieval System Based on Distributed Representations

نویسندگان

  • Zhe Zhao
  • Tao Liu
  • Jun Chen
  • Bofang Li
  • Xiaoyong Du
چکیده

Most text retrieval systems are essentially based on bagof-words (BOW) text representations. Despite popularity of BOW, it ignores the internal semantic meanings of words since each word is treated as an atomic unit. Recently, distributed word and text representations become increasingly popular in NLP literatures. They embed syntactic and semantic information of words and texts into low-dimensional vectors, thus overcome the weaknesses of traditional BOW representations to some extent. In this paper, we implement a text retrieval system that are totally supported by distributed representations. Our new system no longer relies on the matchings of words in queries and texts, but uses semantic similarity to judge if a text is relevant to a query and to what extent, which provides better user experience compared with traditional text retrieval systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image retrieval using the combination of text-based and content-based algorithms

Image retrieval is an important research field which has received great attention in the last decades. In this paper, we present an approach for the image retrieval based on the combination of text-based and content-based features. For text-based features, keywords and for content-based features, color and texture features have been used. Query in this system contains some keywords and an input...

متن کامل

Semiautomatic Image Retrieval Using the High Level Semantic Labels

Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...

متن کامل

Learning to Match using Local and Distributed Representations of Text for Web Search

Models such as latent semantic analysis and those based on neural embeddings learn distributed representations of text, and match the query against the document in the latent semantic space. In traditional information retrieval models, on the other hand, terms have discrete or local representations, and the relevance of a document is determined by the exact matches of query terms in the body te...

متن کامل

Scientific Article Recommendation by using Distributed Representations of Text and Graph

Scientific article recommendation problem deals with recommending similar scientific articles given a query article. It can be categorized as a content based similarity system. Recent advancements in representation learning methods have proven to be effective in modeling distributed representations in different modalities like images, languages, speech, networks etc. The distributed representat...

متن کامل

Aggregation-Based Structured Text Retrieval

DEFINITION Text retrieval is concerned with the retrieval of documents in response to user queries. This is achieved by (i) representing documents and queries with indexing features that provide a characterisation of their information content, and (ii) defining a function that uses these representations to perform retrieval. Structured text retrieval introduces a finer-grained retrieval paradig...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016